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Abstract. The number partitioning problem is a classic problem of combina- 
torial optimization in which a set of n numbers is partitioned into two subsets 
such that the sum of the numbers in one subset is as close as possible to the 
sum of the numbers in the other set. When the n numbers are i.i.d. vari- 
ables drawn from some distribution, the partitioning problem turns out to be 
equivalent to a mean-field antiferromagnetic Ising spin glass. In the spin glass 
representation, it is natural to define energies - corresponding to the costs of 
the partitions, and overlaps — corresponding to the correlations between parti- 
tions. Although the energy levels of this model are a priori highly correlated, 
a surprising recent conjecture asserts that the energy spectrum of number 
partitioning is locally that of a random energy model (REM): the spacings 
between nearby energy levels are uncorrelated. In other words, the properly 
scaled energies converge to a Poisson process. The conjecture also asserts that 
the corresponding spin configurations are uncorrelated, indicating vanishing 
overlaps in the spin glass representation. In this paper, we prove these two 
claims, collectively known as the local REM conjecture. 



1. Introduction 

The study of typical properties of random instances of combinatorial problems 
has recently been the focus of much interest in the theoretical computer science, 
discrete mathematics and statistical physics communities. Many of these problems 
turn out to be closely related to disordered problems in statistical physics DMSZOll 
IMez03| - a connection which has motivated a host of interesting conjectures. In 
this paper, we establish one of these conjectures: the local REM property of the 
random number partitioning problem (Npp). 

The non-random Npp is one of the classic NP-complete problems of combi- 
natorial optimization, closely related to other classic problems such as bin pack- 
ing, multiprocessor scheduling, quadratic programming and knapsack problems 
|GJ97[IACG"'"99| . In addition to its theoretical significance, the Npp has many ap- 
plications including task scheduling and the minimization of VLSI circuit size and 
delay |CL91llTsa92j . public key cryptography MH78, Odl91 , and, more amusingly, 
choosing teams in children's baseball games Hay02 . 

A fixed instance of the Npp is defined as follows: Given n numbers Xi,X2, 
. . . , Xm we seek a partition of these numbers into two subsets such that the sum 
of numbers in one subset is as close as possible to the sum of numbers in the other 
subset. Each of the 2" partitions can be encoded as cr G {—1, +1}"', where cr^ = 1 if 
Xi is put in one subset and ai = —1 ii Xi is put in the other subset; in the physics 
literature, such partitions cr are identified with Ising spin configurations. The cost 
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function to be minimized over all spin configurations cr is therefore the energy 



where we have inserted a factor 1 / ^/n to simplify the equations in the rest of the 
paper. 

Note that the spin configurations cr and —cr correspond to the same partition 
and therefore of course have the same energy. Thus there are N — 2"^^ distinct 
partitions and at most N distinct energies. The lowest of these N energies is the 
ground state energy of the model. The energy spectrum is the sorted increasing se- 
quence El, En of the energy values corresponding to these N distinct partitions. 
Let (t(^), . . . ,(t(^) be configurations corresponding to these ordered energies. The 
overlap between the configurations cr^*) and cr^^^ is defined as 



One often studies random instances of the Npp where the n numbers Xi, . . . , X„ 
are taken to be independently and identically distributed according to some density 
p{X). In most cases studied so far, the are taken to be drawn uniformly from 
a bounded domain, say integer values drawn uniformly from {1, . . . , 2"'} or real 
values drawn uniformly from [0,1]. The statistical mechanics of this model has 
been discussed by several authors |F\i89l fFF98 Mcr98, STNOl . 

When p{X) is the uniform distribution on {1,...,2™}, it turns out that the 
typical properties of random instances depend on the ratio n = m/n. Numerical 
simulations suggested that in the limit n, m — > oo with n fixed, this system had a 
sharp transition at k = 1 between a phase in which there are exponentially many 
optimal solutions with energy or 1, and a phase where the optimal solution is 
unique (except for trivial symmetry) and has energy scaling with 2" |GW96| . This 
was supported by a statistical physics approach jMer98| and confirmed by rigorous 



For the random Npp, the costs of two partitions cr and cr' are a priori highly 
correlated random variables. In (MerOO) . one of the authors made a rather surprising 
"random cost approximation," in which the correlations of energies near the ground 
state were neglected. Within this approximation, it is easy to calculate the statistics 
of the ground state and the first excitations. Remarkably, the results of these 
calculations were later confirmed by rigorous analysis |ljCP01| . which therefore 
suggested that there might be a mathematical basis for this approximation. 

Numerical simulation and heuristic arguments led to an even stronger conjec- 
ture, namely that the statistical independence of nearby levels is not restricted 
to energies close to the ground state but extends to all fixed "typical" energies 
jBFM04| . These authors also conjectured that the overlaps corresponding to these 
energies are uncorrelated. These two claims were collectively called the local REM 
conjecture _BFM04 , since the proposed behavior of nearby energies was analogous 
to that of the random energy model (REM) in spin glass theory |Der81| . In this 
paper, we prove the local REM conjecture for the Npp with a general distribution 
of the Xi. 




(1.1) 




(1.2) 



analysis |BCP01j . 
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In physical terms, the optimal partitions of the Npp are precisely analogous to 
the ground states of a mean-field antiferromagnetic Ising spin system with Mattis- 
like couplings = —XiXj defined by the Hamiltonian 



H{(t) - E^{a) - l■Y,X^XJ^,a, ^ J„a,a, . (1.3) 



n 



Similarly, the energy spectrum and overlaps of the Npp are analogous to those 
of the mean-field antiferromagnetic Mattis spin glass. Our results therefore also 
establish the REM conjecture for this spin glass. 

2. Statement of Results 

Let Xi,...,Xn be independent random variables distributed according to the 
common density function p{x). We assume that X has finite second moment and 
p{x) satisfies the bound 

/•oo 

p(a;)^+' dx <oo (2.1) 



for some e > 0. Note that this includes, in particular, all bounded density functions 
with finite second moment. We use the symbol Pn(-) to denote the probability with 
respect to the joint probability distribution of Xi, Xn ■ 

As in the introduction, wc represent the 2" partitions of the integers {1, ..,n} as 
spin configurations cr e { — 1,+1}", define the energy of er as in Hl.l|l . and denote 
by El, ... , En the increasing spectrum of the energy values corresponding to the 
N = 2"~^ distinct partitions. We also denote by (t'^^\ . . . , cr^^^ the configurations 
corresponding to these ordered energies. 

Finally, we introduce the rescaled overlaps as follows. Consider the random 
variable r„ defined by the condition Er^ < a < Er^+i- For j > i > 0, the rescaled 
overlap is defined by 

1 " 

^ s — 1 

On the basis of both heuristic arguments and numerical evidence, Bauke, Franz 
and Mertens (13FM04j conjectured the following behavior for the energy level and 
overlap statistics of the Npp with the Xi uniformly distributed in [0, 1]: 

Conjecture 2.1. Let Xi, X2, ■ . . , Xn he i.i.d. random variables distributed uni- 
formly in [0, 1], let a > be a fixed real number, and let I be a fixed positive integer. 
Define r by Er < a < Er+i. Then 

|2«~le-3"V2(^^^^ _ _ ^^^^ _ ^) 

=> {UJi,Wi + W2, ...,Wi +W2 + h VUi), 

where Wi are i.i.d. random variables each distributed exponentially with mean 1, 
and ^ denotes weak convergence as n —> 00. In addition, spin configurations 
corresponding to different energy levels become asymptotically uncorrelated in the 
sense that, for all j > i > 0, the rescaled overlap Qij converges to a .standard 
normal. 



For a = 0, the part of the conjecture concerning the energies was already rig- 
orously established in |BUP01| . In this paper we prove that the full Conjecture 
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12.11 for fixed a > holds not only for the uniform distribution, but also for any 
distribution which has finite second moment and satisfies (|2.1|l . 

Theorem 2.2. Let p he a probability density on [— cxd, cxi] with finite second moment 
T^. Assume that p satisfies the condition (|2.1() for some e > 0, let I he a fixed positive 
integer, and let a he a fixed real number. If Xi, are independent random 
variables distributed according to p, and r„ is defined so that Er„ < a < £'r„+i, 
then 

J^2"-ie-"'/(2-')(i?.„+i - a, Er„+2 - a, Er„+i - a) 

TTT^ (ZA) 
=J> {Wi,Wi + W2, Wi + W2 + ■ ■ ■ +Wl) 

where wi are i.i.d. random variables each distributed exponentially with mean 1, 
and =^ denotes weak convergence as n —> oo. In addition, spin configurations 
corresponding to different energies become asymptotically uncorrelated in the sense 
that for any fixed j > i > 0, the rescaled overlap Qij converges in distribution to a 
standard normal, i.e., 

I 2 

lim P„(Qi, > /3) = / -^e~~dx. (2.5) 

n^oo Jg J2-K 



Remark 2.3. 

(1) Substituting = ^ for the case when Xi is distributed as J7[0, 1], observe 
that Theorem 12 . 21 reduces to Conjecture 12. II 

(2) Having established the original REM Coniecture l2.1l the question naturally 
arises whether analogous results hold for energy scales a which grow with n. 
Indeed, the authors of |BFMn4| said that they believe that the conjecture 
might extend to values of a that grow slowly enough with n, although 
computational limitations prevented them from supporting this stronger 
claim by simulations. In a second paper IBCMNOS] , we will show that, 
under suitable additional assumptions on the distribution p, the conjecture 
does indeed hold provided a = o(n^/'*). 

In addition to immediately implying the analogous results for the mean-field 
antiferromagnetic Mattis spin glass (see equation Hl.Hfl l. our theorem on the en- 
ergy spectrum of the NPP also gives the energy spectrum of the one-dimensional 
Edwards- Anderson (1-d EA) spin glass model away from the ground state. The 
1-d EA model has energy E{a) = J2i •^i'^i'^i+i- Consider the transformation 
Ti = CTi(Ti+i and take the boundary condition (T„+i = 1. Then E(a) = J2i Ji'i'ij so 
that, up to a multiplicative factor of y/n, the energy of the Npp with random vari- 
ables Xi is the same as the absolute value of the energy of the the 1-d EA model with 
random variables J^. Note that the energy spectrum of the Npp lies in [0, i^max], 
with iJmax = 0(y/ri), whilc that of the 1-d EA model lies in [— Vn^'max, v^^'max]- 

Our theorem says that properly scaled energies of the Npp converge to a Poisson 
process. By the above transformation, this result obviously applies also to the 
1-d EA model except for energies about zero, which are correlated by symmetry. 
In particular, the result applies to the 1-d EA model in energy intervals of the 
form [y^a, ^/n{a + ^(e^"))] for any bounded a > or their reflection about 0. 
If the interval includes the origin as an internal point, the positive and negative 
energies separately converge to Poisson processes, with the two obviously related 
by a spin-flip symmetry. 
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3. Proof of Theorem 12.21 

3.1. Outline of the Proof. Before we proceed with our proof, note that we may 
assume without loss of generality that the second moment is equal to 1 and that 
p is symmetric, p{x) — p{—x). Indeed, considering the rescaled random variables 
Xi = T^^Xi, we immediately see that the statements of theorem for general r follow 
from those for r = 1. Next, consider the random variables Yi, y„ where each Yi 
is obtained as Xi w.p. ^ or —Xi w.p. ^. It is easy to see that the energy spectrum 
of the Yi, ...,y„ is identical to that of the Xi, ...,X„. Further, from convexity of 
|a;|^^' and Jensen's inequality, it follows that pviv) = ^{p{x) + p{^x)) also satisfies 
()2.1|l . Therefore, w.l.o.g. we can assume that p{x) — p{—x) as claimed, and in 
particular that Xi has zero first moment. For simplicity of notation, we omit the 
subscript n, and denote the probability with respect to the joint distribution of 
Xi, . . . ,Xn by P(-)j and the expectation with respect to this distribution by E(-). 

Let Z„(a, 6) be the number of points of the energy spectrum that lie in the 
interval [a, 6], and let Nn{t) be the number of points in the energy spectrum that 
fall into the (shifted and) re-scaled interval [a, a + t^„], where 

- y|2-("-i)e"^/^ (3.1) 

We must show that Nn{t) converges to a Poisson process with parameter one. To 
this end, we will show that for any family of non-overlapping intervals [ci, di], . . . , 
[cm,dm] with di > Ci > 0, the rescaled variables Z„(a^, bJJ with a\ = a\- Ct^n and 
6^ = a -f di^n converge in distribution to the increments of a Poisson process with 
parameter one. We prove this by showing the convergence of the multidimensional 
factorial moments, i.e., by proving the following theorem. 

Theorem 3.1. Let a > 0, let m be a positive integer, and let [ci, di], . . . , [cm, dm] 
be a family of non- overlapping intervals. For £ = 1, . . . , to, set af^ — a + C£^„ and 
~ Q^ + '^fC"; where ^„ = ■^/^2^'^"^"')e" Given an arbitrary m-tuple (fci, .., km) 
of positive integers, we then have 

m m 

hm nX{{Z^KX))kA = X{{dt - Cip, (3.2) 

where, as usual, {Z)k = Z{Z — 1) . . . (Z — fc + 1). 

Theorem 13.11 establishes that Nn{t) converges to a Poisson with rate one. As 
we will see below, the asymptotic independence of configurations corresponding to 
nearby energy levels is an immediate corollary to the proof of this theorem. 

3.2. Integral Representation. Following the general strategy employed in Sec- 
tion 6 of BCPOj, we base the proof of Theorem 13. II on an integral representation 
of the factorial moments. The derivation of this integral representation uses the 
Fourier transform of rect(a;), where as usual rect(a;) is defined by 

J 1 -\<x<\ 
rect(x) <y Q otherwise. (^■'^) 
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Let — denote the center of the interval [a^, 6^], let — di — cg, and let 

<ln,i — li£,n\/n. Then Zn{a^, 6^) can be written as 

Z„(al,6l)= V/^H (3.4) 



where 

(3.5) 



^ ' 2 



rect(^"=^^-^-"^"^) + rect(^"-^"-^-+'"^) 



Note the factor i, which arises from the fact that each partition is counted only 
once in Z„(a^, 6^), while the two configurations cr and — cr correspond to the same 
partition of {1, . . . , n}. 

Next we write the k^^ factorial moment of Zn{a'^, 6^) as a sum over sequences of 
k distinct configurations (t^^\ . . . jtr^^'. To this end, let us first note that I'^^^cr) is 
either or 1/2, implying that 

/W(^) = /W(^)(/W(^) + /W(-^)) (3.6) 

for all (T e { — 1, +1}". Using this fact, we now rewrite the fc**^ factorial moment as 

E[(Z„(al,6l)),.]^ E[4^)(^W,...,^(^-))] (3.7) 

where the sum runs over distinct configurations and 

/f(<TW,...,<T(^-))=n/W(.w), (3.8) 

with /(^)(-) given by 

To obtain a formula for the multi-dimensional factorial moments, let us con- 
sider two disjoint intervals [a^, b^] and [a^ , and two sequences of configurations 
<t(i), . . .,cr(^'^ and*(i), . . . contributing to {Zn{ai,bi))k, and (Z„(al', 61'))^^, , 

respectively. Recall that the energy of the configuration cr is equal to the energy 
of the configuration — cr, E{(t) = E{—(t). Since I^^^cr) — unless E{(t) e [af^,b^] 
and /(^')(5-) = unless E{a) e K',6l'], we see that /(^) (cr)/(^') (ct) = if cr and 
a are not distinct. The combined sequence cr^^), . . . , crC^'^), cr^^-* , . . . ,cr^'°*'-' there- 
fore only contributes to the product {Zn{a'^,bl^))ke{Zn{al^,b^))k^, if cr^^^ 7^ ±cr'-^ 
for all £ £'. As a consequence, the multi-dimensional factorial moment in The- 
orem is itself given as sum over sequences of pairwise distinct configurations. 
More explicitly, let k = J2T=i ^^"^ ^ ~ I, . . . ,k, let £{j) = 1 if j = 1, . . . , fci, 
£{j) — 2 ii j — ki + 1, . . . , ki + k2, and so on. Then 

m 

E[l[{Z^{albi)),,]= J2 E[4(<tW,...,<t«)] (3.9) 

e=l ±o-(l)^--/±cr('') 

where 

4(^«,...,^('=))=n/w^))(^(^")). (3.10) 
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Using the Fourier inversion theorem and the fact that the Fourier transform of 
the function rect(x) is equal to 

. , f. sin tt/ 
smc(/) = —7-. 

we then rewrite /'^^•'(cr) as 

/oo 
sinc(/<7„,,) cos(27r/ilv^)e2"-^5::.i -.^.^^^ ^3^^^^ 
-00 

leading to the representation 

E[Z„(al,6fj] = qn,iY.A \ sinc(/9„,,)cos(2^/t^V^)e2"^S?=i'^=^=d/ 

(3.12) 

and a similar representation for the factorial moments. As we will see, the expec- 
tation and the integral in 13.12|1 can be interchanged, leading to a representation 
of the first moment, E[Z„(afj, 5^)], in terms of the Fourier transform 

p(/)=E[e2-/^]. (3.13) 

In a similar way, the factorial moments can be expressed in terms of the Fourier 
transform p of of the distribution function p. 

We will use several properties of the Fourier transform in our proof, which we 
summarize now. All of them follow from the fact that the density p{x) has finite 
second moment, satisfies equation ()2.1|l . and is symmetric under the transformation 
X —X. 

(i) For any pi > 0, there exists Ci > 0, possibly depending on pi, such that 

whenever |/| > pi,we have |/5(/)| < e~''^ . 
(m) For any n > Uq, where uq is the solution of + TI~ ^ ^ with e as in H2.1|l . 
we have 



mT< / \Kf)r = Co < 00. (3.14) 

— 00 J —oo 

{Hi) p{f)^0 as l/Hoo . 

(iv) There exists a C2 > such that, for ^ui > small enough, whenever |/| < pi, 
we have \p{f)\ < e^"^^^ . 

3.3. First Moment. In this subsection, we calculate the first moment of the ran- 
dom variable Z„(a^, bfj. To avoid very cumbersome notation, we omit the index £ 
in this subsection, and write a„, 6„, 7 and q„ for a^, je and qn,e, respectively. 

We first show that we can exchange the expectation with respect to p with the 
integral in H3.12|l . This is the content of the following lemma. 

Lemma 3.2. For all n > 1, 

/oo 
sinc(/g„) cos(27r/i„V^)p„(/)d/. (3.15) 
-00 

Proof. We use truncation to justify the interchange of the integral and the expec- 
tation in equation (|3.12() . For any B > 0, define 

Z(^^)(a„,6„) =g„ V / sinc(/g„)cos(27r/i„V^)e2-^/S"=i^=^=d/. (3.16) 

_ J-B 
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Observe that this corresponds to computing the inverse Fourier transform after 
truncating the Fourier transform in the region [—B, B]. Thus we can write 

4^^^K,M-^E/M ^"""^''^"'"'^ )+/^( ^"""^"'^^'"'^ )- (3-17) 

^ Qn Qn 

where fs (x) is the appropriately defined inverse Fourier transform for the truncated 
integral It is known that /^(a;)^ rect(x) as B^oo for aU x ^ ±i. At a: = ±i, 

fsix)—*^- Since Z^^\an,bn) is a finite sum we see that zlr^\an,bn) converges 
almost surely to Z„(a„,6„). The bounded convergence theorem then implies 

E[Z„K,6„)] = lim E[4^^)(a„,5„)]. (3.18) 

Using Fubini's theorem, independence of the Xi and the fact that aiXi has the 
same distribution as Xi , we may express the expectation of the right hand side of 
(|TT^ as 

E[Z(^^)(a„, 6„)] - 2>„ / sinc(/g„) cos(2^/t„V^)E[e2-^-^^]"d/ 

,B (3-19) 



= 2"(j„ / sinc(/g„)cos(27r/t„V")/5n(/)d/- 

J-B 



We now combine Holder's inequality with the bound H3.14|) and the fact that 
sinc(/(7„) is in L^+^ for all e > to bound 

i^B poo 

|sinc(/g„)cos(27r/t„Vn)/5„(/)|d/ < / | sinc(/g„)ll/5(/)M/ 
B J-oo ^3 20) 

<(/ |sinc(/g„)|i+'d/) (/ \pif)rdf) < ^. 

By dominated convergence, the integral in H3.19|l therefore converges as _B— >cx), 
giving 

E[Z„(a„,6„)] = lim E[z(^^)(a„, 6„)] 

r r- (3.21) 

= 2"g„ / smc{fqn) coa{2n ftny/n)pn{f)df, 

J — oo 

as required. □ 

Having established the integral representation (|3.15|) for E[Z„(a„, &„)], we are 
now ready to prove the convergence of the first moment. 

Lemma 3.3. 

lim E[Z„(a„,6„)] =7. (3.22) 

n — >oc 

Proof. The proof is a standard application of well-known techniques from asymp- 
totic analysis. However, for the convenience of the reader, and due to the fact that 
our later proof for higher factorial moments relies on some of the techniques used 
here, we present the full details. 

The proof proceeds in two steps. In the first step we show that, at the cost of 
an error which is negligible as n — > oo, the integral in H3.15|l can be restricted to a 
small neighborhood of with the sine factor replaced by 1, and in the second step 
we expand p"(/) around to get a Gaussian approximation for the integral. 
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We first show that, given any sequence w„ with — > oo and a;„/i/ri ^ as 
n — > cx), we have 



cos(2^/aV^)/5„(/)d/ + o(l). (3.23) 

To this end, let us first restrict the integral in H3.15|l to |/| < /ii. Since |/0„(/)| < 
|p"o(/)|e~'^i'^"~"°'' when n > hq and |/| > /ii, the contribution from |/| > /ii to the 
integral on the right hand side of (|3.15|) is exponentially small in n for large n. In 
the interval |/| < /ii, we can replace sinc(/(7„) by 1 + 0{{fqnY) as n ^ oo, since 
(7„ — !■ as n ~> oo. Observing that tn\/n = oi'^/n + 0(\AiCn) = oi\pn, + 0(g„), we 
can also replace cos(27r/t„-yn) by cos(27r/Q;\/n) + 0{fqn). Replacing sinc(/(7„) by 
1 and cos{2TTftn\/n) by cos(27r/Q;-yn), we thus incur an error that can be bounded 
by 2"0((7^) = o(l), giving the bound 

E[Z„(a„,6„)] = 2V P cos{27rfaV^)pn{f)df + o{l). (3.24) 

To complete the proof of H3.23|l , we have to show that 

lim 2"<7„ / cos(27r/aV^)p„(/)d/ = 0. 

J^n>\f\>^ 

Recall by property (iv) following (|3.14|l that p{f) < e~'^^^ , which implies 

/ cos(2^/«V^)p„(/)d/ < / e-^-"^'(i/ = 0(^e-^^-"). 

Since -^2"g„e~''^"" ^ as n ^ oo, this completes the proof of H3.23|l . 

To evaluate the integral in (|3.23(l , we make a Gaussian approximation in a neigh- 
borhood near origin. While this is standard if one assume that the density p has 
a sufficiently high moment (anything more than the second moment is enough), a 
little care is needed due to the fact that we only assume existence of the second 
moment. 

We start by choosing the sequence w„. Since p has a finite second moment, 
its Fourier transform is twice continuously differentiable, implying that /5(/) = 
1 - 27r2(l + o{l))p = f,-27^^-f{i+o{i))^ Q^jjgj. words, for pi sufficiently small and 
I/I < Mil we can write p{f) in the form p{f) — e^^'^ ^ +g(f)f where g{f) — ^ as 
/ ^ 0. Choose a sequence which goes to zero as n — > oo, say \ogn/y/n. We then 
define 

e„ = sup |5(/)| and w„ = min{logn, 6"^/^}. 

/:|/|<logn/V^ 
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For I/I < LOn/\/n, we then have \npg{f)\ < w^e implying that p„(/) 

g-27r^»/^+o(i) ^ g-27r^i/^^^ _,_ ^(^^-^y consequence, 



2"g„ / cos{2n faVn)pnif)df 

= 2>„/ cos(27r/aV^)e-2^'"/'d/ + o(2"g„) / e-^-'^^'d/ 

= 2"g„ / cos(27r/ay^)e-^'''"^'d/ + o(2"g„n-i/2) 

= 2"q„ / cos(27r/a^^;)e"2'^'"^'^^/ + o(l). 

By an argument similar to proof of H3.23|l . one can show that 
lim / 2"g„ cos{27rfaV^)e-^''^''f^df = 0, 



(3.25) 



implying that 



hm E 

n — >OQ 



/oo 
cos(27r/aV^)e-2-'»/'(i/ 
-oo 

= lim 2"q„^^e"^"' =7. 
"^00 \/2TTn 



(3.26) 



This completes the proof of equation (|3.23|) . □ 

3.4. Higher Moments. In this subsection, we analyze the higher moments. Bear- 
ing in mind the similar structure of the representations (|3.7f) and (|3.9|l , we first con- 
sider the one-dimensional factorial moments E[(Z„(a^, b^))k]- As in Subsection l3.3l 
we omit the index £, and write a„, 6„, 7 and qn for a^, 6^, 7^ and g„,f , respectively. 
We also write /(•) and /fe(-) instead of /'•^•'(•) and 

We want to show that in the limit n 00, the factorial moment E[(Z„(a„, 5„))fe] 
is equal to 7*^. Since the sum in (|3.7() contains (2")^ = 2"'^(1 + o(l)) terms, one 
might therefore try to show that K[Ik{cr^^\ . . . ,cr^''^)] is asymptotically equal to 
^k2-nk generalizing our approach from the last section, which showed that 
E[/(cr(i))] is asymptotically equal to 72^". 

But in contrast to the expectation of /(cr'^'), the expectations of the random 
variables /^(cr^^), . . . ,(t(*')) cannot be analyzed easily for all sequences of distinct 
configurations cr'-^\ . . . , cr^''\ This problem already appeared in [BCPOlj . but here 
it will be harder to overcome. First of all, even the analog of Lemma 13.21 for the 
higher moments will not hold unless the configurations (t^^\ . . . ,cr(''') form a set 
of k linearly independent vectors in R". But more importantly, the expectation of 
Ik{cr^^\ . . . , cr^'^)) will be hard to analyze, even if (t^^\ . . . , cr^'^) are linearly indepen- 
dent, unless we impose additional conditions on the configurations cr(^\ . . . ,(t^''\ 

To overcome these problems, we use the following strategy: first we prove the 
analog of Lemma f3. 21 for E[/fc(<T^^\ . . . , cr^''^)] under the assumption that the con- 
figurations cr^^\ . . . , cr'^'"') are linearly independent. Then we formulate a condition 
on the sequence <t(^), . . . , cr^*^) that allows us to analyze E[/fc(<T(^\ . . . , <t'^'^))] by an 
extension of the proof of Lemma [3.31 Having extracted the leading behavior, we 
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then estimate the contributions of all other configurations cr^^), . . . ,cr('') and show 
that they do not contribute in the limit n oo. 

We start with the analog of Lemma [3. 21 for the higher moments. 

Lemma 3.4. Let k be a positive integer, and let (t^^\ .., cr^'^-' be linearly independent 
configurations in . Then 

E[/fe(a(i), . . . , rr^'^))] ^ qU J] Z^^^^) 11 cos(2^/,i„V^)4f,, 

(3.27) 

where 

k 

z;, l<s<n. (3.28) 

Proof. Let us first rewrite /^(cr'^^ . . . , tj'^^^) as 
/.(<t(^\...,<tW) 

= 1n n ^'"""^ ^^"^ ''"'■^^ n sinc(/,<7„) cos(2^/,t„VH)d/: 



5=1 j=i 



(3.29) 



In III n e'"''^'' n sinc(/,g„) cos(27r/,t„V^)d/,. 



OO „_i 



Arguing as in the proof of Lemma 13.21 we then have 

E[4(<t(i),...,<tW)] = hm E[/p^(<T(i),...,aW)], (3.30) 

where 

4^^)(^(i), . . . , ^C^)) ^q'n n ^'"""^"^ n sinc(/,(7„) cos(27r/,t„^A^)d/,. 

(3.31) 

Using again Fubini's theorem and independence of the Xi, we now get 



E[4^^)(^(i), . . . , ^C^-))] = 9^ / / / n /^C"^) n sinc(/,<z„) cos(2^/,i„ 

(3.32) 

To continue, we will use the fact that cr^^^ . . . , cr^*^) are linearly independent, im- 
plying that the matrix Mk = [o'i'''']j<k,s<n has rank k. Relabeling, if necessary, 
let us assume that [cTi^jjKk, ■ ■ ■ , [^k^]j<k form a basis of the row space. Holder's 
inequality, the fact that |p(i's)| < 1, and a change of variables from /i, . . . , /fe to 
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vi, . . . ,Vk then leads to the bound 

n k 

'df: 



s=l j=l 

pep CO ^ X / f 1 ^ 

<(/// ni«i^^(/^'?«)i'^'^/^) (///^~' |ni^(«^)i"''n^/^ 

n |sinc(/,g„)r^rf/.) [Jk \l[\p{vs)rdv. 



< oo, 



(3.33) 

where uq and e are as in the proof of Lemma 13.21 and Jk is the Jacobian of the 
change of variables from /i, . . . , to vi, . . . ,Vk. By dominated convergence, we 
can therefore take the limit i? ^ oo in H3.33|l . Putting everything together, this 
gives □ 

Next we would like to prove that for a "typical set of configurations" cr^^^ , ...,cr^''\ 
the integral on the right hand side of H3.27|l is equal to (27rrt)~'^/^e~'^" /^(l + o(l)). 
Here the meaning of typical is best formulated in terms of the matrix formed by 
the row vectors cr^^^, . . . , crC'). More generally, for u < k and cr^^^, . . . , cr^") e 
{— let AIu be the matrix with matrix elements <Ti''\ where 1 < j, s < u. 
Given this matrix and a vector 5 € {—1, 1}", let 

=n,(<T(i),...,<T(«)) = \{j < n : (af \ = ^}| (3.34) 

be the number of times the column vector S appears in the matrix Af„. 

If one were to choose configurations (t^^\ . . . ,cr(") G { — independently 
and uniformly at random, then for all S E {—1, +1}", the expectation of ns is clearly 
equal to n2~'^. By a standard Martingale argument, for most configurations, the 
difference between ns and 7i2^" is then not much larger than see Lemma f3. 81 
below. Let us therefore assume for the moment that 

max|n5(T«,...,rT(«))-^| < V^A„ (3.35) 
S 2" 

for some A„ — s- oo to be chosen later. The next lemma shows that under this 
condition, the right hand side of H3.27|l behaves as desired, provided A„ is chosen 
appropriately. For concreteness, we chose A„ = logn, even though the proof works 
for much larger class of sequences. 

Lemma 3.5. Let A„ ~ logn, let k be a positive integer, and let cr^^', . . . ,cr('^' be a 
sequence of configurations of rank k that satisfies (|3.35|l . Then 

2"'=E[4(<t(i),...,<tW)] =7^^+0(1), (3.36) 

where the constant implicit in the o- symbol depends on k. 

Proof. In view of Lemma 13.41 we will have to estimate the expression 

p C poo ^ 

2"'g' / / / n '°(^-) n sinc(/,<Zn) cos(27r/,t„V^)d/,. (3.37) 
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Let /ii, ci, C2 and uJn be as in the proof of Lemma |3.3I In a first step, we want 
to show that the contribution of the region where \vs\ > /ii for at least one s is 
neghgible. 

Thus consider the event that one of the |ws|'s, say \vt-^\, is larger than fii. Let 
= {<7ti\ •■jO'tf^}: and let d^, ...,S'' be vectors such that the rank of {d^, ■■,S''} is 
k. Let {vt2, ■■■,vtk} be defined by 

Since the vectors {<5^, .., S'^} have rank fc, we can change the variables of integration 
from fj to vtj . Let the Jacobian of this transformation be Jk ■ The Jacobian Jk is 
bounded above by the largest determinant, Jmax, of a matrix of size k whose entries 
are ±1. We now bound the integral over the region where |utj > /ii as follows: 

n k 

Jk Y\_ Pi'^s) Y\_ sinc(/j(7„) cos{2-K fjtn\/n)dvt^ 

oo "'|Dti |>A'l s=l j=l 

<J„^a. fir f[\pivtX''dvt^x [ IpivuT^'dvt, (3.38) 

Since only grows like a power of n while the number of choices for 8*^ is 

bounded by 2^ and ngi — n2'~^ + o{n) by the bound H3.35|l . we conclude that 
the contribution of the regions where at least one of the l^sl's is larger than /ii is 
exponentially small in n. 

Consider now the region where all |ws|'s are bounded by /ii. In this region, we 
again would like to approximate the sine factors in (|3.37|) by one. To this end, we 
first note that 

k 



|/,| =max|7;,|. (3.39) 

Indeed, by the triangle inequality, we clearly have that maxg \ vs\ < X]j l/jl- To 
prove the opposite inequality, we use that ns = ^(1 + o(l)) > for every 5 £ 
{— 1,+1}", implying that there exists a Vgg that is evaluated as \ fj\- 

region where all | Ws I's are bounded by /ii , we therefore have that all fj 's are bounded 
by fii, so that smc{qnfj) = 1 + 0{q^). By the fact that tny/n = ay/n + 0{qn), 
we furthermore have that cos{2tt f jtny/n) — cos(27r /ja^/n) + 0{qn)- Replacing the 
sine factors by 1, and the product of cos(2tt fjtn^/n) by cos(27r /jay^), we therefore 
obtain an error which can be bounded by 2"'^g^O((7„), an error which again goes 
to zero exponentially in n. 

We thus have show that, up to an error which is exponentially small in n, the 
left hand side of (|3.36() is equal to 

2""9' n ^(««) n cos(2^/,aV^)d/,. (3.40) 
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ti 

\vt^\>U>^/y^ (3.41) 



Next we show that we can further restrict the range of integration to \vs\ < 0Jn/\/n 
for all s. To this end, let us consider the integral where w„ < < /ii, while Vs 
can be an arbitrary number in [— /^i, /^i] for all other s. We will then have to bound 
the integral 

J J Ja)„/Vn<|fti|<A'l s=l j=l 

Using the fact that p{v) < e^^^"^ foj- |^| < ^^^^ tj-^jg can be easily accomplished, 
leading to the bound 

J J J —oc j—2 

Using the facts that w„ 00, ngi/n = 2^^ + o(l) and 2'^"(j^ = 0{n''/'^), this 
implies that over all 2"" sequences of configurations cr(^\ . . . , cr^") G {—1) the 
contribution of the regions where {vs-^ \ is larger than ojn/ \/n is negligible. However, 
since there are at most 2*^ different possibilities for \vs\, we see that the contribution 
of the regions where any one of the |ws|'s is larger than ojn/y/n is negligible. 

For \vs\ < iOnl\fn^ we approximate /5(i's) by/5(i's) = exp(— 27r^u^+o(l/n)). Using 
the shorthand ns for the quantity n<5(cr(^\ . . . , tr^*^^), and defining as X]j=i '^j/jj 
we then rewrite 

n 

n = n '^(^■5)"' = cxp(-2^2 ^ ^^^2 ^ 

We would like to approximate the sum in the exponent by p = X]^=i /J- To this 
end, we first note that 

E Y.^n!n^nJn= E E /J = ^'^ E /J = ^V^- 
<5e{-i,+i}'= ji J2 (5e{-i,+i}'= J J 

If rig was egwa/ to 2r^n for all ^, the sum in the exponent would therefore be equal 
to but for general 8 we get the bound 

^ngvl-nf = ^{ns-2-^n)vl < f maxjna - 2-'=?i| j E ^'i (3.42) 

Using the condition H3.35|l . and the fact that \vg\ < ujn/V^^ "^"^ bound the right 
hand side by Xnl^uj^j \fn = o(l). We thus have shown that for \vs\ < 

n 

[] pivs) - (1 + o(l)) exp(-2^2^/2 

s=l 

Combining the bounds proven so far, we conclude that, up to an error which is 
negligible as n — > c», the expression in H3.37|l is equal to 

h 

2""gf, /// n cxp(-27r2n/2) cos(27r/,aV^)d/„ (3.43) 

where the integral goes the region where \vs\ < uJn/^/n for all s. Since, by an 
argument very similar to the argument leading to (j3.38|l and H3.41(l . the integral 
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of n^=i '2^P(~27r^n/|) over a region in which \vs\ > cj„/^/n for at least one s is 
negligible, we therefore have shown that 

2""E[/fe(<T(i),...,rT«)] 

r f- poo 

= S""?,"; / / / n exp(-27r2n/2) cos(27r/ji„ + o(l) (3.44) 

J J J —OO j—l 

= 7' + o(l), 

as desired. □ 

As we will see below, the number sequences of configurations (t^^\ . . . , cr^''^ that 
are linearly independent and satisfy the bound (|3.35() is 2"'^(1 + o(l)). Restricting 
the sum in H3.7|l to these configurations and using Lemma 13.51 to estimate the 
expectation of Ik{cr^^\ . . . , cC^))], we get a contribution to the fc**^ factorial moment 
that is asymptotically equal to 7^^. To prove Theorem 13.11 we have to bound the 
contribution of the remaining terms. To this end, we first establish an upper bound 
on the expectation of /fc(<T(^\ . . . , cr^'^^)] that does not rely on the condition H3.35|l . 
To formulate this bound, we introduce the following notation. 

Definition 3.6. Let no be such that 1/no + 1/(1 + e) ~ 1 where e is the constant 
from assumption (|2.1I) . We say that the configurations cr'^^^ (t*^"' has riQ-rank 
Mo if the maximum number of linearly independent column vectors S G { — 1,+1}" 
such that 

n5((T(i),...,cr("))>no (3.45) 

is equal to uq. 

Lemma 3.7. Given a positive integer u, there exists a constant Cu such that for all 
sets of linearly independent row vectors (T^^\...,(T^'^'^ G {-1,+1}" that have UQ-rank 
Uq, we have 

E[/„((T(i),...,cr("))] < C„C+^"""''^^"°- (3.46) 



Proof. Let As C {1, . . . ,n} be the set of indices i such that the column vector 
{<J^^\ . . . , o'j"'') is equal to S, and let Yg be the random variable 

ieAs 

Recalling the definition (|3.8|l . we then rewrite /^(cr^^^ . . . , cr^")) as 

/„(<t(i),...,^("))= (3.48) 

= 2-" E nrect(&^^^^^^^^^^) 

^e{_i,+i}.j=i 

where A is the set of vectors d G {— 1,+1} such that ns > I. 

Choose u linearly independent vectors S''^\ . . . , G A such that the vectors 
S^'^\ satisfy the condition JTIHll . Let Aq = {6^^\ 5^"°'}, and let 

All = . . . , d^"^}. Denoting the /c-fold convolution of p with itself by pk, we 

then write the expectation of a typical term on the right hand side of (|3.48() as 

.EaeA 1 s J _ ^ ^ ^ Ku{yA\Ao) Yl Pnsiys)dys (3.49) 



'[n '''^''^(" 



qn 
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where yA\Ao is a shorthand for the collection of variables ys, 6 G A \ Aq, and 

KuiyA\Ao) is the integral 

if.(2/A\Ao)= /// nrect(^^^Aiz^^l^Z^) p^dys. 

j=i 6eA„ 

Combining the relations (|3.48|) and H3.49|l and observing that 

n Pnsiys)dys^l 

<5eA\A„ 

by the fact that ns > 1 for all ^ e A, we clearly have that 

E[/„(cr(i),...,cr("))] < sup Ku{yA\Ao)- (3-50) 

It is therefore enough to bound Ku{y a\Ao) uniformly in j/a\Ao- 

Let aj = Tjtn^/n — J2seA\A^^jys- Noting that aj does not depend on the 
variables which are integrated over in K^, we then rewrite K^^ as 

Ku{yA\Ao)= III nrect( ^'^^^" -) Jl Pns{ys)dys 

Let M be the matrix with matrix elements Mji = 5'"^\ The product of the rect- 
functions in the above integral then ensures that 

I " 1 
max \ S^Mjiyg^i)-aJ < -g„. (3.51) 

3 = 1,. ..,u\^ 2, 
z— 1 

Since the vectors in A„ are linearly independent, the matrix M is invertible. Let 
/3i = X]j'=i(-^~^)uO!ij ^'^'^ ||M^^|| be the norm of as an operator from l^^ 
to ^oo- The bound 1)3. 51|) then implies that 



max 



2/5(0 - A < (3.52) 



where g„ = ||M -"^Hgn. As a consequence, the integral is bounded by 

Ku{yA\A,)< ///nrect( ^'^'T^' ) n P^Mdys 
i=i 5eA„ 

= ri / rect(^^^)p„.(y)dy (3.53) 
= Cn/ p"'(/)sinc(9„/)e2"'^"/d/, 

i— 1 

where we used the shorthand rii = For i = 1, . . . , uq, we use that rii > uq to 
bound the integral on the right by 

p"»(/)sinc(g„/)e2"^'-^d/< / |p"«(/)|d/ = Co, (3.54) 
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while for z = uq + 1, . . .u, we use rii > I and Holder's inequality to obtain the 
bound 



p"-(/)sinc(g„/)e2"'^"^d/< / |p(/) sinca/)|d/ 



< 



nf)\df '^'""\ I |sinc(g„/)|i+^d/ 
r ;t-i/(i+«) 



l/(l+<=) 



(3.55) 



Here 



Co 



l/no r 



r'ifM / |sinc(/)|^+^d/ <(X3 (3.56) 



1/(1+e) 



is independent of m, uq and n. Observing that 1 — 1/(1 + e) = l/no, we thus get 
the bound 

Ku{y A\Ao) < (maxCo, Co)>;?+("""»)/"°. (3.57) 
Since there is only a finite number of choices for a set A of w linearly independent 
vectors in {— 1,+1}", the ratio (jn/qn = is bounded by a constant that 

depends only on m, implying the existence of a constant C„ such that 

ifn(yA\Ao) < aC + '"""°^/""- (3.58) 

Combined with lpnH|l and (pO^ . this proves the lemma. □ 

In order to bound the contribution in equation 13 .71) coming from the terms where 
the vectors S''^\ have rank u < k or do not satisfy condition (|3.35|) . wc need 

the following lemma, whose main statements were already proven in BCPQl . 

Lemma 3.8. 

(1) Given u < k linearly independent row vectors cr^^\ . . . , cr^'^\ there are 
at most 2"('''~") ways to choose cr("+-'^\ . . . , cr^*^) such that the matrix M 
formed by the row vectors (T^^\ . . . , cr^*^) has rank u. 

(2) Given u and hq, there are constants C3 — €3(1*, no) and C3 = C3(it, no) 
such that there are at most C^n'^-^2'^^° ways to choose u linearly independent 
configurations cr^^\ ...,cr(") that have no-rank uo- 

(3) Let u < 00, let C4 = £4(7/) = 2^~^^ , and let A„ be a sequence of positive num- 
ber such that Xn/\/n as n ^ 00. Then the number of configurations 
(t(^\ cr(") that violate condition (|3.35|l is bounded 6y C42""e^5-^" . 

(4) Let (t(^\ ...,cr('^) be distinct spin configurations, assume that rank M < k, 
and let cr'^^\ . . . , cr'^^^ be linearly independent. Then ns{(T^^\ cr'''^^) = for 
at least one S G { — 1,+1}", implying in particular that, for n sufficiently 
large, cr^^), cr^") violate condition (13.35(1 . 

(5) Given u, let (t''^\ ...,ct''^'^ G {— 1,+1}" be an arbitrary set of row vectors 
satisfying ((3.35|1 . Then 

g(cr('^),crW) < (3.59) 
Vn 

whenever a ^ b. For n sufficiently large, condition H3.35|l therefore implies 
that (T^^\ ...,cr(") are linearly independent. 

Proof. Except for the second statement, the lemma mainly summarizes the relevant 
results from Section 6 of IBCPDll . More explicitly: statement (1) is proved in the 
paragraph following (6.10), and for A = logn, statement (3), is proved in the 
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paragraphs around (6.12), statement (4) is proved in the paragraph around the 
second and third unnumbered equation after (6.12), and statement (5) is equivalent 
to the bound (6.14). 

It is not hard to see that the arguments in Section 6 of [BCPOlj can be generahzed 
to arbitrary sequences A„ of positive number, as long as \nl\pn — > as n — s- oo. 
Indeed, starting with statement (3), let us consider n independent trials with 2" 
equally likely outcomes, and use Chebychev's inequality to bound the probability 
that |n5(cr(^), cr^")) — n2~^\ > y/nXn by 2e~2^,i. Combined with the union 
bound for the 2" different random variables ns{cr'-^^\ cr^"'), 5 € {—1, +1}", this 
gives statement (3). The first part of (4) does not involve the value of A„, and the 
second follows from the first whenever Xn/y/n — > as n oo. To prove the bound 
in statement (5), we rewrite the overlap q{cr^°-\cr'^''^) as 

(a) rr(b) \ - 



^' ns{cT^'\...,cT(-^)- ns{cT^^\...,<T^''^)). (3-60) 

Se{-i,+iV <5e{-i,+i}" ^ 

Noting that each sum contains 2"^^ terms, we see that the bound (|3.35|1 implies 
the bound 13.59|l . Finally, the last statement of (5) is a direct consequence of (|3.59f) 
and the fact that A„/ ^/n — > as n — > cxi. 

We are left with the proof of (2). To this end, let us consider the matrix Mu 
obtained from M„ by omitting all columns (J^^\ . . . , tJj-"^ that are equal to a vector 
5 £ {— 1,+1}" with ns{cr'-^\ ...,cr(")) < hq. Note that the number of columns n' of 
Mu is at least n — 2"no and at most n. Fixing n', for the moment, and noting that 
the rank of Mu is uq, we now use statement (1) to conclude that there are at most 



Uq 



< 2"*+" /^2""° (3.61) 



ways to choose Mu- Given Mu we need to insert n — n' columns in {—1, +1}" to 
obtain the matrix Mu- Including the number of choices for the positions of these 
n — n' columns, this gives an extra factor of 

" ]2(»-"')«<^ L^„"-«'2»-«' < L^^"o2-2"o2" /3 g2) 

n — n' J [n — n'y. [n — n')\ 

Combining the two factors and summing over n' G {n — no2", . . . ,n}, we get a 
bound of the form C3n''^2^°" where C3 and C3 depend only on u and ng. □ 

Having Lemmas 13.41 13.51 13. 71 and 13. 81 in hand, we are now ready to prove Theo- 
rem 

3.4.1. Proof of Theorem \S.l[ We start with the case m = 1, i.e., the one-dimensional 
factorial moment E[(Z„(a„, 6„))fc]. As in Lemma 13.51 we choose A„ = logn. Con- 
sider the sum over all sequences cr'-^\ . . . , cr'-''^ e {^1, +1}" that satisfy the bound 
(I3.35|l . By Lemma|^Bl(3), this sum contains 2"'^(l-f o(l)) terms, and by Lemma|^HI 
(5), the matrix formed by the row vectors cr^-^\ . . . , crC^' has rank if n is large 
enough. With the help of Lemma 13.51 we conclude that the sum over all these 
terms gives a contribution to the fc**^ factorial moment which is equal to 7'^ + o(l). 

To prove Theorem l3.1l we have to bound the contribution of the remaining terms. 
To this end, we group the remaining terms in the sum 1)3. 7|l into four classes: 
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(1) Sequences of distinct configurations (t(^\ . . . ,(t('') of rank k and no-rank 
uq < k that violate the condition (|3.35(l : 

(2) Sequences of distinct configurations cr^^\ . . . , cr^*^'' of rank k and no-rank k 
that violate the condition H3.35|l : 

(3) Sequences of distinct configurations cr(^\ . . . , cr^*^) of rank u < k such that 
there is a subsequence of linearly independent configurations cr^^-', . . . , cr*-"-' 
of no-rank uq < u; 

(4) Sequences of distinct configurations cr(^\ . . . , cr^*^) of rank u < k such that 
all subsequences of linearly independent configurations a-^^\ . . . , o-'-"-' have 
of no-rank uq = u; 

By Lemma 13.81 (4). the configurations cr'^-' , . . . , cr^"' in class (4) must violate 
condition H3.35|l . Relaxing the constraint that the configurations in class (1) violate 
condition (|3.35|l . it is therefore enough to bound the following two error terms: 

• the sum ^ of all sequences of configurations <t(^\ . . . , cr'*^) of rank u < k 

containing a subsequence of linearly independent configurations cr^^\ . . . , cr^"-* 
of no-rank uq < u, and 

• the sum i?^ of all sequences of configurations cr^^^ , ■ ■ • , t^'^' of rank u < k 
such that all subsequence of linearly independent configurations cr^^-' , . . . , cr*-"' 
obey condition H3.35|l and have no-rank uq — u. 

Before bounding these two error terms, we note that 

k u 

h{<T^'\. . . ,<tW) = l[licT^'^) < n^(*^'') = Iui^^'\- ■ ■ , *^"^), (3.63) 
implying that 

E[4(^W,...,<T«)] <E[/„(^(i),... (3.64) 

whenever o-*-^-* , . . . , cr*-"-' is a subsequence oi (t^^\ . . . , ar^'^K 

In order to bound f., we now use (|3.64|) and Lemma l3.7l to bound the expecta- 

tion of . . . by C„g:^''+("-"°^/"'' < csg^J^g^"", where C5 = max„<fe C^. 

Using Lemma f3. 81 (2) to bound the number of sequences cr'-^-', . . . , o-*-"-* of no-rank 
Mo by C6n'^'*2""'', where Ce = max„<fc C3(u, no) and ce = maxt,<fe C3(m, no), and 
Lemma f3. 81 fl) to bound the number of ways cr'^^\ . . . , cr^*-'' can be obtained from 
a-^^\ . . . , dr^"-*, we therefore obtain the following upper bound 

i?<fc<C6C5n^« V f^)2"(^-^)(2>„)""9y"°- (3.65) 



Uq 

UQ<u<.k 



Since = 0(>/n), we get that 

<fe = 0(n^^9y"'') (3.66) 

where the constant implicit in the 0-symbol depends on fc, a and 7, and where 
C7 = Cfi + k/2. Since qn falls exponentially with n, this proves that R^ k ~ ''(I)- 

The error term R^ ^. can be bounded in a similar way. We again use (|3.64(l and 
Lemma 13.71 to bound the expectation of /fe(cr(^\ . . . ,cr('^)), but now we use part 
(3) of Lemma to bound the number of sequences cr'^\ . . . ,a-^^K Using again 
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Lemma f3. 81 (1) to bound the number of ways cr^^', . . . jCr^*^) can be obtained from 
cr'-^'', . . . , or^"-', we now obtain the upper bound 

i?=fc<C5C8 (^)2"('-")e-^"/2(2>„)« (3.67) 

UQ<u<k 

where cg = maxu < fcc4(w) = 2''+\ Using again that 2"g„ = 0{y^), we conclude 
that 

RZ,^0{n^/'e-^^^/'). (3.68) 

Since e^^^l"^ = e^'°s "/^ decays faster than any power of n, the right hand side 
goes to zero as n — s- cx), as desired. 

This completes the proof that E[(Z„(a„, 6„))fc] ^ as n ^ oo. To prove the 
convergence of the higher-dimensional factorial moments, we need to generalize 
Lemmas 13.41 13.51 and 13.71 But, except for notational inconveniences, this causes 
no problems. Indeed, comparing the representations H3.8|l and (|3.10|) . we see that 
the only difference is the appearance of several distinct intervals [al''"'\ fel''"'^] for the 
energy of the configuration cr^^\ instead of the sa.me intervcil [ttij, hji 1 for all of them. 

As a consequence, the statement of Lemma has to be modified, with the right 
hand side of H3.27|l replaced by 

n «n/ / / / n n ^^^<!,^nm) C0s(2^/,il(^) V^)d/, . (3.69) 

But the proof remains unchanged, since it never used that qn,i{j) or tn is constant. 

In a similar way, the proof of Lemma 13.51 needs only notational changes: the 
arguments leading to (|3.4U|I now give a prefactor 2"'^ Yij Qe(j) instead of T^^q^, but 
the integral multiplying this prefactor (and therefore the rest of the proof) remains 
unchanged, proving that under the conditions of Lemma 13.51 

2"^-E[/fe(^(i), . . . , ^t^-))] = Ht^O") + 0(1). (3.70) 

3 

Turning finally to the proof of Lemma ITTI we note that its proof goes through if 
we replace (j„ by max£=i_...^m (7„^£. As a consequence, the bound H3.46|l has 

to be modified to 

E[4(^«,...,^("))] <Cu{ max 9„,,)"''+("-"")/"°, (3.71) 

^— l,...,m 

which does not change the n-dependence of the bound. 

Using these generalizations of Lemmas 13.41 ITsl and 1X71 it is easy to see that the 
bounds H3.65|) and (|3.68|l remain unchanged, except for the fact that the implicit 
constants in the O-symbols now depend on max^ 7^ instead of 7. This completes 
the convergence proof for the multi-dimensional factorial moments, and hence the 
proof of Theorem 13. II 

Remark 3.9. Throughout this section, we have assumed that a is bounded. For 
convenience m our companion paper we note that the above estimates 

on ii^j. and i?^^, can be easily be generalized to growing a if we choose A„ ap- 
propriately. Indeed, making the a-dependence of our bounds explicit, we obtain 

<fc = 0(n=^e'="'/2^y"") (3.72) 
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and 

Rn,k = 0(n'^-/2efeaV2e-A^/2)^ (3.73) 

as long as A„ = o{^/n). For a = o{^\/n), qn decays exponentially in n, and R"^ = 

0(1). Choosing A„ in such a way that a = o(A„), A„ — o(y^) and e^^"^^ decays 
faster than any power of n, we also have k ~ 

In order to prove Theorem l3.1l for growing a, we therefore only need to generalize 
the statements of Lemmas 13.31 and 13.51 For a = o{n^/^), this will be done in 

|ij(jMNn5| . 

3.5. Overlap Estimates. To complete the proof of the Theorem 12.21 we need to 
show that the rescaled overlaps converge to a standard normal. Defining R{(3) to 
be the tail of the standard Gaussian, 

R{(3) = I -^e-'^dx. 



Jl3 

we therefore have to show that for any P € R and any j > z > 0, we have 

P(Qy >I3)^ R{I3) (3.74) 

as n — !■ oo. 

Let Er^+i and Er^+j be the i^^ and j"^ energy above a, respectively, and let 
Aq > 0. Having established the convergence H2.4|l of the rescaled energies, we note 
that the probability that both Er^+i and Er„+j into the interval [a, a + Ao^n] 
can be made arbitrary close to one by choosing Aq and n large enough. Consider 
further a discretization scale rj such that Xo/rj is an integer. If both E^^+i and 
Er^+j fall into the interval [a, a + Ao^n], each of them must fall into one of the 
Ao/77 intervals [a, a + 77^„], [a + 77^„, a + 277^„], . . . , [a + (70 - ?y)Cn, a + AoCn]- By 
choosing 77 sufficiently small and n sufficiently large, the probability that both fall 
into the same interval, or that one of the other energies between a and a + AoC„ 
falls into the same interval as E^^+i or Er^+j, can be made arbitrarily close to one 
as well. 

It is therefore enough to consider the intersection of the event Qij > (3, the 
event that Er^+i and Er^+j fall into two different intervals of the form [a + (m — 
ct + m?7^n], m — 1, . . . , Ao/77, and the event that both of them are the only 
energies that fall into these intervals. Denote the intersection of these events by 
Aij{[3). Decomposing the event ^ij(/3) according to the spin configurations (t(''"+') 
and (t(''"+^) corresponding to the i^^ and j**^ energy above a and the particular 
intervals containing these energies, we then rewrite the probability of Aij as 

jYiiKnij cr.cr 

(3.75) 

Here the second sum runs over pairs of distinct configurations cr, cr with rescaled 
overlap larger than (3, the first sum runs over integers nii, m,j with < rrii < rrij < 
Ao/77, the symbol Am{cr) denotes the event that the energy of the configuration 
cr falls into the interval [a + (777 — 1)77^^, a + 77i77^„], and the random variables 

(£) 

Zn are equal to the number of points in the spectrum that lie in the intervals 
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[an\bn''] where ai^' = a, 5^^' = ai^^ = a + (mi - 1)77,^„, b^'' = ai^' = a + mirj^n, 
bn ^ = Qn'' — a + (m2 — l)r]^n, and bi'*^ ~ a + m2riin- Defining as before, let 



{Zn) 



(/3) _ 



(3.76) 



be the number of distinct pairs of configurations cr, cr with rescaled overlap at least 

f2) f2l 

/3 such that the energy of cr falls into the interval [0^ ,6™ ], and the energy of cr 
falls into the interval [a[i\b'if']. We then rewrite the probability P(Aij(/3)) as 



(3.77) 



I(Z, 



(3) 



1)I(Z, 



(4) 



where 1(A) denotes the indicator function of the event A. 

Let Nn{P) be the number of distinct pairs cr, cr with rescaled overlap at least /3. 
Combining the methods of the last section with the standard central limit theorem, 
we now easily establish that 



E 



(Zn) 



(/3) 



= 7722-2"iV„(/3)( 1 + 0(1) 



7,2 i?(/3) 1+0(1) 



(3.78) 



In order to analyze the right hand side of H3.77|) we would like first to factor the 
expectation on the right hand side, and then use (|3.78|) and Poisson convergence 
of the random variables Zn to analyze the resulting terms. In the process, we will 
have to analyze the factorial moments 



E 



(3.79) 



Unfortunately, the methods of the last section cannot be directly applied to these 
factorial moments since the sum over configurations representing the above expres- 
sion is not a sum over pairwise distinct configurations: comparing, e.g., the sum 
over cr in (|3.76|) and the representation of the random variable ^ cLS db sum over 
configurations, 

z(2)=V/(2)(^'), (3.80) 



(2) ;,(2)i 



we see that both involve configurations whose energy lies in the interval [a„ , 6„ 
But this problem can be easily overcome by considering the random variables Zri 



(2) 



1 and Z^^' — 1 instead of Z^'' and Z^^-*. We therefore consider the expression 



E[(Z„)f (Z(i)),,(Z(2) - 1),,(Z(3)),3(ZW - 1),^ 



^(/3), 



cr ,cr 



/(2)(^)/W(^)(z(i)),,(z(2) - - 1).. 



(3.81) 



We claim that this expression can again be be expressed as a double sum over 
distinct configurations, allowing us to apply the methods of the last section. Indeed, 
let us first consider the product 

/(2)(<T)(42) - 1),, = /(2)(^)(42) - 1)(42) - 2) . . . (Z(2) - k,). (3.82) 

Proceeding as in the proof of H3.7|l , we now rewrite this product as a sum of con- 
figurations cr^^\ . . . jCr''^^^ which are mutually distinct and distinct from cr. In a 
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similar way, the product I^'^'> ia){zi^^ - 1)^, can be expressed as a sum over mu- 
tually distinct configurations which are distinct from <t. Using these facts, we now 
proceed as before to obtain the bound 



E 



where 71 = r]{mi — 1) and 73 = 77(7712 — 7711 — 1) 
Consider the four random variables Zn \ Zn 
with the probability distribution /i defined by 



(3.83) 



1, Z^n^ and Zi"*^ - 1, together 



(3.84) 



E 


\Zr,)^^h{Zi'^ = ll)I(Zi'^ - 


1 = ^2)1 


[(Zi') = l3)l{Z^n^ - 1 = 14)" 


E 







The bounds H3.78|l and 13.83|l then establish that in the measure the four random 
variables Zn\ Zn^ — 1, Zn'' and Zn^ — 1 converge to four independent Poisson ran- 
dom variables with rates 77, 72, rj and 74, respectively. Using once more the bound 
(|3.78|l , we conclude that the expectation in the sum in H3.77|) can be approximated 
as 



E 



(Z„)f 'l(Z(i) = ^ 1)I(Z(2) - 1 = 0)I(Z(3) =J-^- 1)I(ZW -1 = 0) 



1 j — 

„2 7l 73 ^-(2r,+y 



i+^^)i?(/3)(l + o(l) 



' (7-l)!(j--i-l)! 
Inserted into H3.77|l this gives the bound 

PiA.m ^ Kr,iXo)R{f3)[l + 0(1)] 
where 

((?71l - 1)7;)'-1 ((7772 - 7771 - l)^)^-'-^ 



0) 



V 



E 



(7-1) 



ij 



mi <m2 

is the Riemann-sum approximation to the integral 



An 



1)1 



j — i—l 

73 



(j - « - 1)! 



(3.85) 
(3.86) 

(3.87) 
(3.88) 



As 77 — > 0, the Riemann sum Kjj{Xq) converges to the integral K{Xq), and as 
Aq ^ 00, the integral K{Xo) converges to 1. Choosing first Aq large enough, then 
77 small enough, and then n large enough, the normal distribution function i?(/3) 
is therefore an arbitrarily good approximation to P(Ay(/3), which in turn can be 
made arbitrary close to V{Qij > /3), again by first choosing Aq sufficiently large, 
then 77 sufficiently small, and then n sufficiently large. This establishes (|3.74ll and 
hence the remaining statements of Theorem 12. 21 
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4. Generalizations and open problems 



4.1. Generalizations of the Npp. The Npp has a natural generalization: Divide 
a set {^1, X2, . . . , Xn} of numbers into q subsets such that the sums in all q subsets 
are as equal as possible. This is known as multi-way partitioning or multiprocessor 
scheduhng problem jBME03| . The latt er name refers to the problem of distributing 
n tasks with running times {^i, X2, ■ ■ ■ , Xn} on q processors of a parallel computer 
such that the overall running time is minimized. Bovier and Kurkova ' BK04| con- 
sidered the restricted multi-way partitioning problem where the cardinality of each 
subset is fixed to n/q. For this model they could prove the "energy part" of the 
local REM hypothesis at a = 0, i.e., the convergence of the properly scaled near 
optimal solutions to a Poisson point process. The local REM (including the "over- 
lap part") is conjectured to be valid for all a > for the multi-way partitioning 
problem in the unrestricted case (i.e., for n/q not necessarily fixed) |BFM04| . This 
generalization is still open. 

4.2. Universality. In |BM04| it is conjectured that the local REM is a property 
of discrete, disordered systems well beyond number partitioning and its relatives. 
Since this conjecture represents a fascinating open problem for the rigorous com- 
munity, we briefly review the heuristic argument of |BMfl4| : Consider a model with 
an energy function of the form 



where the cr is an n-dimensional vector with binary entries ai — ±1 or (T; G {0, 1} 
and the Xi are real random numbers from the unit interval. In case of the Npp 
(or the 1-d Edwards- Anderson model), any vector <t is a feasible configuration. If 
we add more restrictions, we could write the cost function of many optimization 
problems in the form f l4.ll) '). For example, in the traveling salesman problem, we 
would take ai € {0, 1}, where ai — 1 means that the distance Xi is part of the tour, 
and the ai would have to fulfill the constraint to encode a valid itinerary. In higher- 
dimensional spin glasses, the ai — ±1 encode satisfied or unsatisfied edges, and are 
correlated due to loops in the graph. In all cases we have an exponential number of 
valid configurations, with an exponential number of energy values E{ct). Since the 
range of energies scales only linearly with n, it should follow that adjacent levels 
will be separated by exponentially small distances. The precise value of each gap 
will be determined by the least significant hits in the X^'s, however. The dynamical 
variables ai can only control the n most significant bits of the energy. |BM04| 
argue that the residual entropy of the least significant bits then gives rise to the 
Poisson nature of adjacent energy levels and to the full local REM property. This 
very heuristic argument has been supported by extensive numerical simulations 
in various spin glass models (Edwards-Anderson model, Sherrington-Kirkpatrick 
model. Potts glasses) and in optimization problems (TSP, minimum spanning tree, 
shortest path) •BM04!. 

In this paper, we rigorously established the local REM conjecture for a particular 
model, the Npp. In a recent paper |BK05j . submitted shortly after the present 
one, Bovier and Kurkova showed that the local REM conjecture holds for many 
types of spin glasses as well, in particular the Edwards-Anderson model and the 
Sherrington-Kirkpatrick model. Their approach is based on a general theorem 



n 




(4.1) 



i=l 
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establishing Poisson convergence for an abstract class of models, with conditions 
that are very similar to the statements of our Lemmas IH.5I and 13.81 in an abstract 
setting. 

4.3. Phase Transition. According to the heuristic argument above, the bit-en- 
tropy of the disorder Xi is the essential property that leads to the local REM: if 
it is larger than the entropy of the configurations, the local REM should apply. If 
it is lower than the configurational entropy, the distances between adjacent energy 
levels are multiples of a fixed, smallest distance. In this case, each energy level is 
populated by an exponential number of configurations. An indicator for the transi- 
tion between the two regimes is the maximum overlap between two configurations 
with adjacent energy levels. If the entropy of the disorder is larger than the con- 
figurational entropy, this overlap should be (the local REM). If the entropy of 
the disorder is much smaller than the configurational entropy, this overlap should 
be 1 — Q{n~^). Numerical simulations in |iBM04) indicate that there is a sharp 
transition at the point at which these entropies are the same. A canonical problem 
in which such a transition has been rigorously investigated is the phase transition 
of the Npp |BCP01| . |BM04| propose that a transition of this type may be as 
universal as the local REM. A proof of the universality of this transition poses yet 
another challenge for the rigorous community. 
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